An application of VIM, the R package for visualization of missing values, to EU-SILC data

نویسنده

  • Matthias Templ
چکیده

Package VIM allows to explore and to analyze the structure of missing values in data, as well as to produce high-quality graphics for publications. This paper illustrates an application of VIM to a highly complex data set – the European Statistics on Income and Living Conditions (EU-SILC). 1 The graphical user interface of VIM The graphical user interface (GUI) has been developed using the R package tcltk [R Development Core Team, 2009] and allows easy handling of the functions included in package VIM. Figure 1 shows the GUI, which pops up automatically after loading the package. > library(VIM) If the GUI has been closed, it can be reopened with the following command. All selections and settings from the last session are thereby recovered. > vmGUImenu() For visualization, the most important menus are the Data, the Visualization and the Options menus. 1.1 Handling data The Data menu allows to select a data frame from the R workspace (see Figure 2). In addition, a data set in .RData format can be imported from the file system into the R workspace, which is then loaded into the GUI directly. Figure 1: The VIM GUI and the Data menu.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Visualization of imputed values using the R-package VIM

The package VIM (visualization and imputation of missing values) [Templ et al., 2011a] is developed to explore and analyze the structure of missing or imputed values in data using graphical methods. Getting knowledge about the structure is helpful to identify the mechanism, which is generating the missings, respectively errors, which may have happened in the imputation process. Furthermore, it ...

متن کامل

Visualization of missing values using the R-package VIM

This paper introduces new tools for the visualization of missing values. The tools can be used for exploring the data and the structure of the missing values. Depending on this structure, the tools can be helpful for identifying the mechanism generating the missings. This knowledge is important for selecting an appropriate imputation method to reliably estimate the missing values. The visualiza...

متن کامل

Exploring incomplete data using visualization techniques

Visualization of incomplete data allows to simultaneously explore the data and the structure of missing values. This is helpful for learning about the distribution of the incomplete information in the data, and to identify possible structures of the missing values and their relation to the available information. The main goal of this contribution is to stress the importance of exploring missing...

متن کامل

Applications of Statistical Simulation in the Case of EU-SILC: Using the R Package simFrame

This paper demonstrates the use of simFrame for various simulation designs in a practical application with EU-SILC data. It presents the full functionality of the framework regarding sampling designs, contamination models, missing data mechanisms and performing simulations separately on different domains. Due to the use of control objects, switching from one simulation design to another require...

متن کامل

Simulation of EU-SILC Population Data: Using the R Package simPopulation

This vignette demonstrates the use of simPopulation for simulating population data in an application to the EU-SILC example data from the package. It presents a wrapper function tailored specifically towards EU-SILC data for convenience and ease of use, as well as detailed instructions for performing each of the four involved data generation steps separately. In addition, the generation of diag...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009